A mask-based enhancement method for historical documents
نویسندگان
چکیده
This paper proposes a novel method for document enhancement. The method is based on the combination of two state-of-the-art filters through the construction of a mask. The mask is applied to a TV (Total Variation) regularized image where background noise has been reduced. The masked image is then filtered by NLmeans (Non Local Means) which reduces the noise in the text areas located by the mask. The document images to be enhanced are real historical documents from several periods which include several defects in their background. These defects result from scanning, paper aging and bleed-through. We observe the improvement of this enhancement method through OCR accuracy.
منابع مشابه
Enhancement of historical printed document images by combining Total Variation regularization and Non-local Means filtering
This paper proposes a novel method for document enhancement which combines two recent powerful noise-reduction steps. The first step is based on the total variation framework. It flattens background grey-levels and produces an intermediate image where background noise is considerably reduced. This image is used as a mask to produce an image with a cleaner background while keeping character deta...
متن کاملA Novel Thresholding Method for Text Separation and Document Enhancement
Many thresholding-based image enhancement techniques have been developed and used for document analysis, where the simplicity and efficiency of thresholding makes it ideal to use for classifying layers within documents. However, the efficiency of these enhancement techniques can be impaired by the variation of grey levels in different documents, thus causing over-thresholding or under-threshold...
متن کاملDegraded document image enhancement
Poor quality documents are obtained in various situations such as historical document collections, legal archives, security investigations, and documents found in clandestine locations. Such documents are often scanned for automated analysis, further processing, and archiving. Due to the nature of such documents, degraded document images are often hard to read, have low contrast, and are corrup...
متن کاملThe Development of "Naqsh-e Jahan" Square in Isfahan
Despite numerous studies regarding the development history of Naqsh-e Jahan Square, there are still many questions which have not been accurately answered to date. Some of them include the history of the square, the exact date of initiation and completion of construction of different elements of the square, and the order of their completion. This article tries to answer these questions accurate...
متن کاملAn Adaptive Method for Physical Documents Digitization based on Global Energy Function Parameter
The first step of physical document analysis system is to digitalize the physical document. Recently number of researcher present numerous techniques that can vary in sensitivity, quality and some more control parameter. This paper presents a three tier framework for physical document digitization and describes an automatic technique for document digitization that can significantly increase the...
متن کامل